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Chaos control in Random Boolean networks is implemented by freezing part of the network to 
drive it from chaotic to ordered phase. However, controlled nodes are only viewed as passive blocks 
to prevent perturbation spread. This paper proposes a new control method in which controlled nodes 
can exert an active impact on the network. Controlled nodes and frozen values are deliberately 
selected according to the information of connection and Boolean functions. Simulation results show 
that the number of nodes needed to achieve control is largely reduced compared to previous method. 
Theoretical analysis is also given to estimate the least fraction of nodes needed to achieve control. 
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1. Introduction 

Random Boolean networks (RBNs) are an abstract model of genetic regulatory networks 
suggested by Kauffman p], promising to reveal some principles of living systems with a sys- 
tematic view. RBNs' application has been found in areas like sociology, neural networks and 
music generation (21 El Hj . 

The classical model of RBN is described as follows. There are N nodes, every node with 

K edges pointed from others. The state of every node takes Boolean value. State of a node 

in the next time step is determined by the states of its input nodes and an individual Boolean 

function (a lookup rule table). The probability for values in the lookup rule tables to take 1 is 

called the bias of the RBN, marked as p. Many other models are proposed considering different 

updating schemes, while in this paper we mainly focus on the classical model. 
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As specific networks can be extremely large and complex, studies concentrate on the behav- 
ior of certain ensembles of RBNs. For example, N, K and p are often fixed, and connections, 
rule tables and initial states supposed to be randomly picked at the beginning of revolution. 
With some stochastic approach, theoretical analysis can be given to such ensembles and verified 
by simulation with randomly generated RBNs. Then, the properties found in an ensemble can 
be applied to a particular system which is a member of it. 

Similar to other dynamic systems, phases of ordered, chaotic and critical can be found in 
RBNs, usually defined by properties of perturbation spread. The critical condition is one of 
the most important issues in RBNs. Another interesting topic is chaos control. A network in 
chaos phase may be driven into periodic behavior when states of a certain percentage of nodes 
are determined externally j5l El [7j. In previous work, nodes to be controlled and their frozen 
values are all picked up randomly. Our work shows that they can be deliberately selected to 
achieve high control efficiency. 

The rest of the paper is organized as follows: Section 2 introduces related work on RBN 
control. Section 3 explains the concept of canalizing and the basic idea of our method. Section 
4 elaborates the method and gives theoretical analysis. Simulation results are given in Section 
5. Section 6 makes conclusion and discuss about future work. 

2. Related work on RBN control 

We first define some symbols about RBN. Denote the state of node i at time t as Xj(t), and 
the Boolean function of node i as f\. The input nodes of node i, namely the nodes pointing to 
node i, are denoted as i\ r . . ,%k ■ RBN evolves in the way 

x i {t + l)=f i (x il {t),...,x iK (t)),i = l,2,...,N. (1) 

Ref.[6] gives a general form of RBN control. Define ^ max as the maximum percentage of 
controlled nodes. F T (t) is any positive function with periodic r, varying between and 1. Let 
7(0 = 7 ma:r -FV(£)> denoting the percentage of controlled nodes at time t. Before evolution, 
N / y max controlled nodes are picked up randomly from the network. For convenience we assume 
that their sequence numbers are 1, . . . , N^ max . Each controlled node is given a random value 
Ci E {0, 1}, 2 = 1,..., N^ max . At each time step t, the states of node 1, . . . , Nj max are fixed 
to their corresponding Ci, while the remaining controlled nodes are updated freely. For RBNs 
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in chaos phase, a <y maz large enough can drive the networks into ordered phase, which usually 
appear as periodic behavior with an integer multiple of r to be the period. 

To estimate the least ^ max needed to control the networks, an approximation method is 
used [8]. The basic idea is to assume that the state of one of the nodes flips, which introduces 
perturbation (or damage) into the network. If the state of a successor node flips at the next 
time step, we say the perturbation spreads (or percolates) to this successor node. If the mean 
probability that one damage spreads in a control period r is larger than 1, damage amplifies 
in the network, meaning the system is in chaos phase, and vice versa. Thus, letting the mean 
probability to be 1, we can get the critical condition of the system, corresponding to the least 
j max to achieve control. The critical condition is 

[2p(i-^rnV-T(*)) = i- (2) 

t=o 

This is because the probability for a free node to flip is p, and a node has K(l — 7(t)) free 
successor nodes in average. Letting ^ max = 0, we get the critical condition of free RBN: 

2p(l -p)K= 1. (3) 

In this analysis, controlled nodes only serve as passive blocks to prevent damage spread. 
However, as the driven power of the network, we expect they have active impact on the network. 
Moreover, controlled nodes and their values are chosen totally randomly. It can be expected 
that better efficiency be achieved if more information about the network is utilized in selection. 
In the following sections, the paper will show that this can be done by selecting the controlled 
nodes and their fixed values according to a certain measure. Before that, we need to introduce 
the concept of canalizing. 

3. Canalizing and the basic idea of our method 

A Boolean function is canalizing if, whenever one input node (canalizing input node) takes a 
certain value (canalizing value), the function always gives out the same output [3]. For example, 
x\orf{x2i . . • , xk) is a canalizing function because the function always yields 1 when x\ = 1. 

Our basic idea is: if a controlled node is a canalizing input node to one of its free successors, 
and it is fixed to the canalizing value, the successor node always yield the same output and 
is therefore insensitive to any perturbation. In this way, fixing a controlled node does not 
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only prevent itself from spreading damage, but does the same thing to at least one free node, 
changing it into a controlled node equivalently. 

As each node has several successor nodes, a controlled node's value should be fixed to the 
very value which is canalizing to more successor nodes. All nodes in the network can also be 
sorted according to such a measure. Those who freeze most free nodes should be selected as 
controlled nodes so that high control efficiency is achieved. 

We extend the concept of canalizing to a more general idea. Let the percentage of Is in 
node i's rule table be Ri. When one of i's input nodes, node j, is fixed to a constant value C, 
denote the percentage of Is in i's possible output values in the rule table as Ri\ Xj =c- Using 
these symbols, canalizing can be expressed as Ri\ Xj =c = or 1. If Ri is close to or 1, the 
node can also prevent damage spread efficiently as different inputs are likely to yield a same 
output (we will prove this in the next section). 

To fix a controlled node j to a certain value C means its successor node i's percentage of Is 
in possible output values changes from Ri to Ri\ Xj =c- Counting the canalizing successor nodes, 
which is mentioned above, does not take Ri\ Xj =c that is close to or 1 into account, and ignore 
the possible negative effect on successor nodes as Ri\ Xj =c may be more close to 0.5 than R { 
is. Based on this observation, we suggest our method which uses another measure instead of 
counting canalizing successor nodes. 



4. RBN control based on reducing mean damage perco- 
lation rate 

In eq.(3), mean damage percolation rate is calculated by 2p(l — p), which depends on a 
priori parameter p, the "genotype" of the network. For a given RBN, p can be unknown and 
mean damage percolation rate should be estimated by R iy the "phenotype" of the network. 

We adopt the same assumption that eq.(3) uses: at first the input of a node is randomly 
picked, and a flip of state in a certain input node makes the input changes randomly to another 
one. For node i, there are 2 K Ri Is and 2 K (1 — Ri) 0s in the rule table, thus the probability for 
node i to flip is 

fr ^rr + (1 - ^W^l = 2^=T^ (1 " Ri) - (4) 

As Ri(l — Ri) is a quadratic function with a maximum at Ri = 0.5, it proves the statement 



4 



in Section 3 that nodes with Ri close to or 1 can prevent damage spread effectively. Since 
damage percolation rate of a single node is obtained, the mean damage percolation rate of the 
network should be 

1 nK+l N 

1 N 

On the other hand, we calculate the expectation of — } Ri(l — Ri): 

^ i=i 

1 N 

E ^(1 - Ri)) = E(Ri(l -R l )=p-p 2 + Var(Ri). (6) 
iV i=i 



As Ri is Bernoulli distributed, 



Var(R,) = (7) 



Thus we have, 

i N 2 K — I 

E{„ E W - Ri)) = ~^p0- ~ P) ( 8 ) 

Y 2 K+1 N 

which shows that — ^ — R{) can be an estimation of 2p(l — p). 

From two different points of view, we reach the same conclusion that — — = Ri(l — Ri) 

N 2 — 1 z=1 

is the mean damage percolation rate of the network, which is also proved as a theorem in 
Ref. [in] • If it is reduced as much as possible, the network is likely to fall into ordered phase. 

We implement chaos control in RBNs with deliberate selection of controlled nodes and their 
fixed values as follows: let 0(i) be the set of output nodes of node i. Define 

Si,C — Ri(l — Ri) + zl [RjO- ~~ Rj) ~~ Rj\xi=c(l — Rj\xi=c)) (9) 
jeo(i) 

and 

Si = max{S ifi , Si,i}, (10) 

,0 if Si n > Si i 
Q={ l '° . (11) 

1 otherwise 

Sort all nodes according to their Si and a new order (1), . . . , (N) is obtained so that S^ > 
S(jj for any i < j. At every time step, use node (1), . . . , (Nj(t)) as controlled nodes and fix 
them to (7(i), . . . , C( N ). 

N 

When Xi is fixed to C, Ri(l — Ri) is removed from Y^i2^(l — R t ) and damage percolation 

i=i 

rate of all node z's output nodes are altered. 5$ c is in proportion to the amount that the mean 
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damage percolation rate of all rest nodes is reduced by fixing Xi = C . As C can be or 1, we 
choose the fixed value which makes more reduction. We select the nodes with largest Si to be 
controlled nodes so that the mean damage percolation rate of the network is reduced to least. 
The mean damage percolation rate at time t is 

nK+l N N-y(t) 

^tQXi-^)- 

,= V- 7 (t)) • (12) 

And the critical condition is 

nif+l N Ny(t) 

ri( — - — k — - — ) = 1 - ( 13 ) 

t=0 iV 

We should mention that such analysis does not take into account the situation that a free 
node has two or more controlled nodes as input nodes. However, this is rare when Kj(t) < 1, 
which is ignored here. 

To estimate the least r f nax needed to implement control is difficult because the out-degrees 
of the nodes vary as the out-degree of a single node has a standard deviation of nearly \fk. 
Therefore, the distribution of Si is hard to obtain. In addition, Si has strong correlation with 
each other, which makes it difficult to calculate the expectation of S^y 

We give a rough estimation of r ) max by adopting the following assumptions: (1) S^c of 
different i are independent with each other, and so are the out-degrees. (2) Consider S it c and 
out-degrees of a certain network which exactly fits the presumed distribution, and calculate 

AT 7 (t) 

S'(j) of such a network as an estimation. We explain our assumptions below in detail. 

i=i 

Let Oi = and O be a random variable independent identically distributed (i.i.d.) 

K K 

with O-i. O is Bernoulli distributed with parameter (N — 1, — — -). As (N — 1)— — - = K, 
when is very large, O is approximately Poisson distributed with X — K, which means 

P(0 = n) w ^-e~ K . (14) 

Assumption (2) means, we consider a network which has P{0 = n) • N nodes with out-degree 
n. 

Let Xi = Ri(l-Ri). For j e 0(i), X iJtC = Rj{l-Rj)-Rj\ Xl =c{^-Rj\x l= c)- The meaning 
of assumption (1) is, X it j t c of different i,j are independent with each other and Xi. However, 
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Xijfi and X^i are not independent because both of them rely merely on a same rule table. 
Here we have 

s i,c = Xi + X i,jfi- ( 15 ) 
jeO(i) 

For Oi = n, let So= n ,c be an random variable i.i.d. with S^c- Let Xc=o and Xc=\ be 
two random variables dependent on a same rule table, i.i.d. with X^q and Xij^. Since the 
distribution of Xc=o and Xq=i can be obtained by enumerating possible rule tables of a node, 
the distribution of So= n ,c can also be obtained by iteration 

P{So=n+l,0 — s 0> So=n+l,l = s l) 

= £ P(S O =n,0 = so - 00, Sb=n,l = *i - 6i)P(X c=0 = , X c=1 = (16) 

00,01 

For Oj = n, let S'o=n a random variable i.i.d. with Si. Its distribution can be obtained by 

P(S =n = S) 

= £ P(S O =n,0 = S, S 0=n ,l = + E ^(^O=n,0 = s', 5 0=n ,l = s). (17) 
s>s' s>s' 

In the very network we consider, the number of nodes whose Si — s is approximately 

N-l 

P{0 = n)P(S 0=n = s)N. (18) 

n=0 

Ny(t) 

So far, we are able to estimate S^ with assumption (2). For K = 3, p = 0.5 and 

i=l 



K = 3, p = 0.3, we calculate the estimation of ffffl , m = 1,2, ...,N and that of a real 

i=l 

network, shown in Fig.l. For normalization, the X and Y axes are both divided by N. It can 
be seen that the estimation is accurate, especially when m is not large. 



5. Simulation results 

To compare our method with previous work, we perform 100 simulations for each (j> ) ^ max ) 
with N = 1000, K = 3. F T (t) = sin 2 (nt/T) and r = 50. We count within how many of 100 
simulations the network is driven into periodic behavior and plot the fractions on Fig.2. If 100 
simulations all succeed in control, the small rectangle at (j>^ max ) is painted white. If they all 
fail, the rectangle is painted black. Intermediate fractions are painted according to the color 



7 



° 0.1 

£ 

» 0.06 




o 0.1 

E 

ra 0.05 



2 



0.4 0.6 
m/N 

(a) p=0.5 



2 0.4 0.6 

m/N 



Li 3 



(b) p-0.7 



Figure 1: Curves of sum of m largest Si are given, (a) is drawn under p = 0.5 and (b) is under 
p = 0.3. The solid line is theoretical estimation, and the dashed line is calculated from a real 
network. 



bar shown at the right of the plots. As can be seen, the least ^ max needed to implement control 
is largely reduced. Theoretical bounds of the two methods are given by the dot lines. 



6. Conclusion and future work 

In this paper, we propose a new control method in RBNs. Inspired by the concept of 
canalizing, we use the controlled nodes not only as passive blocks to prevent damage spread, 
but to exert active influence on the network by altering the property of its free nodes. Then we 
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(a) Method in Ref.[5] (b) Our method 

Figure 2: Simulation result and theoretical bound of two methods. 
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develop a measure with more generality and clearer physical meaning than counting canalizing 
nodes, and propose our method based on such a measure. Besides controlled nodes, the method 
determines the fixed values. The innovation of this method is that we make full use of connection 
information. More precise measure could be used to replace mean damage percolation rate 
suggested in this paper, while the idea that fixing node has an influence on successor nodes 
could be remained to form a new method. However, this idea also results in the difficulties in 
estimating the least ^ max needed to achieve control. Independence assumptions are adopted to 
give a rough estimation. 

For future work, more information could be utilized to realize efficient control. Although we 
have already made most of connection and Boolean functions information, our control method 
fails to consider the dynamic states of the system, which means it is an open-loop control 
method. A measure of percolation rate such as sensitivity [JO] which is related to RBN's 
current state could be used in selecting fixed values which vary with time, implementing a 
close-loop control. In addition, selection of controlled nodes should consider the topology of 
the network. Fixing specific nodes to build blocks could separate sensitive nodes apart to 
prevent damage spread cascading [IT] . Finally, math tool is to be found to accurately analyze 
the method that connection information usage causes difficulty in calculating expectation of 
strong correlated sorted statistics. 
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